NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Unlocking the Unusable: A Proactive Caching Framework for Reusing Partial Overlapped Data

https://doi.org/10.1145/3736548.3737839

Guo, Chang; Podhorszki, Norbert; Eisenhauer, Greg; Xie, Zhiwen; Klasky, Scott; Cao, Zhichao (July 2025, ACM)

Cache systems are widely used to speed up data retrieving. Modern HPC, data analytics, and AI/ML workloads generate vast, multi-dimensional datasets, and those data are accessed via complex queries. However, the probability of requesting the exact same data across different queries is low, leading to limited performance improvement when a traditional key-value cache is applied. In this paper, we present Mosaic-Cache, a proactive and general caching framework that enables applications with efficient partial overlapped data reuse through novel overlap-aware cache interfaces for fast content-level reuse. The core components include a metadata manager leveraging customizable indexing for fast overlap lookups, an adaptive fetch planner for dynamic cache-to-storage decisions, and an async merger to reduce cache fragmentation and redundancy. Evaluations on real-world HPC datasets show that Mosaic-Cache improves overall performance by up to 4.1× over traditional key-value-based cache while adding minimal overhead in worst-case scenarios.
more » « less
Full Text Available
LegoIndex: A Scalable and Modular Indexing Framework for Efficient Analysis of Extreme-Scale Particle Data

https://doi.org/10.1145/3731545.3731591

Guo, Chang; Yan, Ning; Wan, Lipeng; Cao, Zhichao (July 2025, ACM)

Full Text Available
Targeted delivery of TGF-β mRNA to murine lung parenchyma using one-component ionizable amphiphilic Janus Dendrimers

https://doi.org/10.1038/s41467-025-56448-y

Meshanni, Jaclynn A; Stevenson, Emily R; Zhang, Dapeng; Sun, Rachel; Ona, Nathan A; Reagan, Erin K; Abramova, Elena; Guo, Chang-Jiang; Wilkinson, Melissa; Baboo, Ishana; et al (December 2025, Nature Communications)

Full Text Available
Can ZNS SSDs be Better Storage Devices for Persistent Cache?

https://doi.org/10.1145/3655038.3665946

Yang, Chongzhuo; Cao, Zhang; Guo, Chang; Zhao, Ming; Cao, Zhichao (July 2024, ACM)

Full Text Available
CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure

https://doi.org/10.1145/3654927

Yu, Qiaolin; Guo, Chang; Zhuang, Jay; Thakkar, Viraj; Wang, Jianguo; Cao, Zhichao (May 2024, Proceedings of the ACM on Management of Data)

Optimizing LSM-based Key-Value Stores (LSM-KVS) for disaggregated storage is essential to achieve better resource utilization, performance, and flexibility. Most of the existing studies focus on offloading the compaction to the storage nodes to mitigate the performance penalties caused by heavy network traffic between computing and storage. However, several critical issues are not addressed including the strong dependency between offloaded compaction and LSM-KVS, resource load-balancing, compaction scheduling, and complex transient errors. To address the aforementioned issues and limitations, in this paper, we propose CaaS-LSM, a novel disaggregated LSM-KVS with a new idea of Compaction-as-a-Service. CaaS-LSM brings three key contributions. First, CaaS-LSM decouples the compaction from LSM-KVS and achieves stateless execution to ensure high flexibility and avoid coordination overhead with LSM-KVS. Second, CaaS-LSM introduces a performance- and resource-optimized control plane to guarantee better performance and resource utilization via an adaptive run-time scheduling and management strategy. Third, CaaS-LSM addresses different levels of transient and execution errors via sophisticated error-handling logic. We implement the prototype of CaaS-LSM based on RocksDB and evaluate it with different LSM-based distributed databases (Kvrocks and Nebula). In the storage disaggregated setup, CaaS-LSM achieves up to 8X throughput improvement and reduces the P99 latency up to 98% compared with the conventional LSM-KVS, and up to 61% of improvement compared with state-of-the-art LSM-KVS optimized for disaggregated storage.
more » « less
Full Text Available
Pd-Catalyzed Asymmetric Amination of Enamines: Expedient Synthesis of Structurally Diverse N–C Atropisomers

https://doi.org/10.1021/acscatal.3c00732

Zhang, Peng; Guo, Chang-Qiu; Yao, Wang; Lu, Chuan-Jun; Li, Yingzi; Paton, Robert S.; Liu, Ren-Rong (June 2023, ACS Catalysis)

Full Text Available

Search for: All records